Text, Speech and Dialogue

On the basis of two-speaker spontaneous conversations, it is shown that the distributions of both pauses and speech-overlaps of telephone and face-to-face dialogues have different statistical properties. Pauses in a face-to-face dialogue last up to 4 times longer than pauses in telephone conversations in functionally comparable conditions. There is a high correlation (0.88 or larger) between the average...

chapter

A Speech Platform for a Bilingual City Information System

Thomas Brey, Tomáš Pavelka

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 571-578

This paper describes the SpexKit framework for the development of spoken dialogue systems, which is currently used to implement prototypes of a bilingual city information system. We sketch the overall architecture of this speech platform, its dialogue manager and its scripting language, as well as the integration of speech technology components like ASR or TTS systems.

chapter

Rapid Dialogue Prototyping Methodology

Trung H. Bui, Martin Rajman, Miroslav Melichar

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 579-586

This paper is about the automated production of dialogue models. The goal is to propose and validate a methodology that allows the production of finalized dialogue models (i.e. dialogue models specific for given applications) in a few hours. The solution we propose for such a methodology, called the Rapid Dialogue Prototyping Methodology (RDPM), is decomposed into five consecutive main steps, namely:...

chapter

Building Voice Applications from Web Content

César González-Ferreras, Valentín Cardeñoso-Payo

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 587-594

Using voice to access on-line information from the web would be really useful, because of the proliferation of mobile devices which allow Internet access anytime and anywhere. However, vocal interface is sequential and not persistent, and thus, we have to restructure the information in order to achieve an efficient and natural way of interaction. Our proposal is based on converting original web contents...

chapter

Information-Providing Dialogue Management

Melita Hajdinjak, France Mihelič

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 595-602

The central module of any natural language dialogue system is the dialogue manager, which plays the role of an intermediate agent between the user and the information source. Its cooperativity and portability highly determines the efficiency of the dialogue system. Therefore, as the basis for cooperativity of information-providing dialogue systems we propose a knowledge representation of the information...

chapter

Realistic Face Animation for a Czech Talking Head

Zdeněk Krňoul, Miloš Železný

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 603-610

This paper is focused on improving visual Czech speech synthesis. Our aim was the design of a highly natural and realistic talking head with a realistic 3D face model, improved co-articulation, and a realistic model of inner articulatory organs (teeth, the tongue and the palate). Besides very good articulation our aim was also expression of the mimic and emotions of the talking head. The intelligibility...

chapter

Evaluation of a Web Based Information System for Blind and Visually Impaired Students: A Descriptive Study

Stefan Riedel, Wolfgang Wünschmann

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 611-619

In this study the usability of two versions of a web based electronic list of literature and information system for blind and visually disabled people were evaluated. Because of the access possibilities of the focus group the applicability for a speech controlled Interface (screen reader, speech controlled web browser) were one point of interest. Furthermore there was focus on the integration of different...

chapter

Multimodal Dialogue Management

Leon J. M. Rothkrantz, Pascal Wiggers, Frans Flippo, Dimitri Woei-A-Jin, more

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 621-628

Unreliable speech recognition, especially in noisy environments and the need for more natural interaction between man and machine have motivated the development of multimodal systems using speech, pointing, gaze, and facial expressions. In this paper we present a new approach to fuse multimodal information streams using agents. A general framework based on this approach that allows for rapid application...

chapter

Looking at the Last Two Turns, I’d Say This Dialogue Is Doomed – Measuring Dialogue Success

Stefan Steidl, Christian Hacker, Christine Ruff, Anton Batliner, more

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 629-636

Two sets of linguistic features are developed: The first one to estimate if a single step in a dialogue between a human being and a machine is successful or not. The second set to classify dialogues as a whole. The features are based on Part-of-Speech-Labels (POS), word statistics and properties of turns and dialogues. Experiments were carried out on the SympaFly corpus, data from a real application...

chapter

Logical Approach to Natural Language Understanding in a Spoken Dialogue System

Jeanne Villaneau, Jean-Yves Antoine, Olivier Ridoux

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 637-644

We present a logical approach of spoken language understanding for a human-machine dialogue system. The aim of the analysis is to provide a logical formula, or a conceptual graph, by assembling concepts related to a delimited application domain. This flexible structure is gradually built during an incremental parsing, which is meant to combine syntactic and semantic criteria. Then, a contextual understanding...

chapter

Building a Dependency-Based Grammar for Parsing Informal Mathematical Discourse

Magdalena Wolska, Ivana Kruijff-Korbayová

Lecture Notes in Computer Science > Text, Speech and Dialogue > Dialogue > 645-652

Discourse in formal domains, such as mathematics, is characterized by a mixture of natural language and embedded formal expressions. Based on an investigation of a collected corpus of informal dialogues on naive set theory proofs, we are developing a dependency-based lexicalist grammar for parsing input with different degrees of verbalization of the mathematical content: ranging from symbolic alone...

chapter

Front Matter

Lecture Notes in Computer Science > Text, Speech and Dialogue > Invited Papers > 1-1

chapter

Speech and Language Processing: Can We Use the Past to Predict the Future?

Kenneth Church

Lecture Notes in Computer Science > Text, Speech and Dialogue > Invited Papers > 3-13

Where have we been and where are we going? Three types of answers will be discussed: consistent progress, oscillations and discontinuities. Moore’s Law provides a convincing demonstration of consistent progress, when it applies. Speech recognition error rates are declining by 10× per decade; speech coding rates are declining by 2× per decade. Unfortunately, fields do not always move in consistent...

chapter

Common Sense About Word Meaning: Sense in Context

Patrick Hanks, James Pustejovsky

Lecture Notes in Computer Science > Text, Speech and Dialogue > Invited Papers > 15-17

We present a new approach to determining the meaning of words in text, which relies on assigning senses to the contexts within which words occur, rather than to the words themselves. A preliminary version of this approach is presented in Pustejovsky, Hanks and Rumshisky (2004, COLING). We argue that words senses are not directly encoded in the lexicon of a language, but rather that each word is associated...

chapter

ScanSoft’s Technologies

Jan Odijk

Lecture Notes in Computer Science > Text, Speech and Dialogue > Invited Papers > 19-19

I will first sketch some background on the company ScanSoft. Next, I will discuss ScanSoft’s products and technologies, which include digital imaging and OCR technology, automatic speech recognition technology (ASR), text-to-speech technology (TTS), dialogue technology, including multimodal dialogues, dictation technology and audiomining technology. I will sketch the basic functionality of these technologies,...

INFONA - science communication portal

Text, Speech and Dialogue
7th International Conference, TSD 2004, Brno, Czech Republic, September 8-11, 2004. Proceedings

Invited Papers

Dialogue

Speech

Text

Front Matter

Durational Aspects of Turn-Taking in Spontaneous Face-to-Face and Telephone Dialogues

A Speech Platform for a Bilingual City Information System

Rapid Dialogue Prototyping Methodology

Building Voice Applications from Web Content

Information-Providing Dialogue Management

Realistic Face Animation for a Czech Talking Head

Evaluation of a Web Based Information System for Blind and Visually Impaired Students: A Descriptive Study

Multimodal Dialogue Management

Looking at the Last Two Turns, I’d Say This Dialogue Is Doomed – Measuring Dialogue Success

Logical Approach to Natural Language Understanding in a Spoken Dialogue System

Building a Dependency-Based Grammar for Parsing Informal Mathematical Discourse

Front Matter

Speech and Language Processing: Can We Use the Past to Predict the Future?

Common Sense About Word Meaning: Sense in Context

ScanSoft’s Technologies

Filter options

Publication date

Content availability

Publication language

Keywords

INFONA - science communication portal

Text, Speech and Dialogue 7th International Conference, TSD 2004, Brno, Czech Republic, September 8-11, 2004. Proceedings $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication language

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Text, Speech and Dialogue
7th International Conference, TSD 2004, Brno, Czech Republic, September 8-11, 2004. Proceedings